Hierarchical Clustering Using One-Class Support Vector Machines
نویسنده
چکیده
This paper presents a novel hierarchical clustering method using support vector machines. A common approach for hierarchical clustering is to use distance for the task. However, different choices for computing inter-cluster distances often lead to fairly distinct clustering outcomes, causing interpretation difficulties in practice. In this paper, we propose to use a one-class support vector machine (OC-SVM) to directly find high-density regions of data. Our algorithm generates nested set estimates using the OC-SVM and exploits the hierarchical structure of the estimated sets. We demonstrate the proposed algorithm on synthetic datasets. The cluster hierarchy is visualized with dendrograms and spanning trees.
منابع مشابه
Fault diagnosis in a distillation column using a support vector machine based classifier
Fault diagnosis has always been an essential aspect of control system design. This is necessary due to the growing demand for increased performance and safety of industrial systems is discussed. Support vector machine classifier is a new technique based on statistical learning theory and is designed to reduce structural bias. Support vector machine classification in many applications in v...
متن کاملMining Biological Repetitive Sequences Using Support Vector Machines and Fuzzy SVM
Structural repetitive subsequences are most important portion of biological sequences, which play crucial roles on corresponding sequence’s fold and functionality. Biggest class of the repetitive subsequences is “Transposable Elements” which has its own sub-classes upon contexts’ structures. Many researches have been performed to criticality determine the structure and function of repetitiv...
متن کاملConversational Topic Classification using SVMs and Features Induced by Clustering
This work explores the use of Support Vector Machines (SVM) for topic classification of conversations. An All-vs-One SVM system is used as the baseline. Several methods in feature weight scaling and feature selection are compared. Results suggest that the conversation domain requires a different set of methods from the written text domain. Finally, a feature selection method based on hierarchic...
متن کاملDensity Clustering Based SVM and Its Application to Polyadenylation Signals∗
Support vector machines (SVM) have been promising methods for classification analysis due to their solid mathematical foundations. Clustering-based SVMs are used to solve large samples classification problems and reduce the computational cost. In this paper, we present a density clustering based SVM(DCB-SVM) method to predict polyadenylation signal (PAS) in human DNA and mRNA sequences. We decr...
متن کاملProtection or Privacy? Data Mining and Personal Data
A multiclass classification method based on output design p. 15 Regularized semi-supervised classification on manifold p. 20 Similarity-based sparse feature extraction using local manifold learning p. 30 Generalized conditional entropy and a metric splitting criterion for decision trees p. 35 RNBL-MN : a recursive naive Bayes learner for sequence classification p. 45 TRIPPER : rule learning usi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Symmetry
دوره 7 شماره
صفحات -
تاریخ انتشار 2015